A fuzzy k-partitions model for categorical data and its comparison to the GoM model

نویسندگان

  • Miin-Shen Yang
  • Yu-Hsuan Chiang
  • Chiu-Chi Chen
  • Chien-Yo Lai
چکیده

The grade of membership (GoM) model uses fuzzy sets as memberships of each individual to extreme profiles (or classes) on the likelihood function of multivariate multinomial distributions. The GoM clustering algorithm derived from the GoM model is used in cluster analysis for categorical data, but it is iterated with complicated calculations. In this paper we create another approach, termed a fuzzy k-partitions (FkP) model, which is also based on the likelihood function of multivariate multinomial distributions. However, the calculations of the FkP algorithm for clustering categorical data derived from the proposed FkP model are simpler. The proposed FkP clustering algorithm is not only easier in calculation than the GoM, but also has more accuracy and computation efficiency. To verify it, we employ real empirical data and also some simulation data. We find that FkP has superior results to GoM. We then apply these two algorithms to classification of pathology. The results show the superiority of the FkP clustering algorithm. Moreover, the proposed FkP algorithm can be used as a fuzzy clustering algorithm for categorical data. Some comparisons between FkP and two popular algorithms, fuzzy k-modes and fuzzy centroids, are made. These results show that the FkP clustering algorithm can be another useful tool in analyzing categorical data. © 2007 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A NEW APPROACH FOR PARAMETER ESTIMATION IN FUZZY LOGISTIC REGRESSION

Logistic regression analysis is used to model categorical dependent variable. It is usually used in social sciences and clinical research. Human thoughts and disease diagnosis in clinical research contain vagueness. This situation leads researchers to combine fuzzy set and statistical theories. Fuzzy logistic regression analysis is one of the outcomes of this combination and it is used in situa...

متن کامل

An Empirical Comparison between Grade of Membership and Principal Component Analysis

t is the purpose of this paper to contribute to the discussion initiated byWachter about the parallelism between principal component (PC) and atypological grade of membership (GoM) analysis. The author testedempirically the close relationship between both analysis in a lowdimensional framework comprising up to nine dichotomous variables and twotypologies. Our contribution to the subject is also...

متن کامل

Long-term Streamflow Forecasting by Adaptive Neuro-Fuzzy Inference System Using K-fold Cross-validation: (Case Study: Taleghan Basin, Iran)

Streamflow forecasting has an important role in water resource management (e.g. flood control, drought management, reservoir design, etc.). In this paper, the application of Adaptive Neuro Fuzzy Inference System (ANFIS) is used for long-term streamflow forecasting (monthly, seasonal) and moreover, cross-validation method (K-fold) is investigated to evaluate test-training data in the model.Then,...

متن کامل

The exploitation of “Grade of Membership (GoM) model” and “Combined Analysis method” in analysing of urban dynamics and urban hierarchy in Khorasan region (1956-2016)

Concerning the importance of urban network studies and the role of hierarchy system in population decentralization in metropolitan areas, this paper by using acknowledged indicators and Grade of Membership (GoM) model and using Combined Analysis method has tried to find the changes in urban hierarchy of Khorasan region during the period 1956-2016. The outcomes show that urban system in Khorasan...

متن کامل

DIAGNOSIS OF BREAST LESIONS USING THE LOCAL CHAN-VESE MODEL, HIERARCHICAL FUZZY PARTITIONING AND FUZZY DECISION TREE INDUCTION

Breast cancer is one of the leading causes of death among women. Mammography remains today the best technology to detect breast cancer, early and efficiently, to distinguish between benign and malignant diseases. Several techniques in image processing and analysis have been developed to address this problem. In this paper, we propose a new solution to the problem of computer aided detection and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Fuzzy Sets and Systems

دوره 159  شماره 

صفحات  -

تاریخ انتشار 2008